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Abstract. For every finite generating set on the integer Heisenberg group H(Z), 
Pansu showed that the word metric has the large-scale structure of a Carnot- 
Caratheodory Finsler metric on the real Heisenberg group H(M.). We study the 
properties of those limit metrics and obtain results about the geometry of word 
metrics that reflect the dependence on generators. 

For example we will study the probability that a group element has all of 
its geodesic spellings sublinearly close together, relative to word length. In free 
abelian groups of rank at least two, that probability is 0; in unbounded hyperbolic 
groups, the probability is 1. In H(1i) it is a rational number strictly between 
and 1 that depends on the generating set; with respect to the standard generators, 
the probability is precisely 19/31. 



1. Introduction 

In this paper we will focus on the 3-dimensional integer Heisenberg group H(Z), 
a uniform lattice in the real Heisenberg group H(R), with the word metric coming 
from a finite generating set. We will study large-scale geometric properties that are 
sensitive to the choice of generators, such as the shape and stability of long geodesies, 
and asymptotic density of classes of elements. 

We will take an approach based on work of Pansu and Breuillard [HI [5] exploit- 
ing the fact that all word metrics on H(Z) have the large-scale structure of Carnot- 
Caratheodory Finsler metrics on -ff(K); that is, the Lie group with its CC metric is 
the asymptotic cone of the lattice with the chosen word metric. However, we will 
study large-scale geometric properties that are lost when one passes to the asymptotic 
cone. (Note that studying the Heisenberg group up to quasi-isometry is even coarser, 
identifying all word metrics as well as all left-invariant metrics on H(R) with one an- 
other.) Thus we develop technology for working with these word metrics that uses the 
homogeneous dilation at finite scale together with combinatorial arguments, instead of 
the usual tools of coarse geometry. We think of this set of questions and techniques as 
belonging to "fine asymptotic geometry." 

The geometry of nilpotent groups is closely tied to geometric minimax problems, 
so Section [2] is devoted to the solution of relevant isoperimetric problems in normed 
planes. In Section |3j we apply those findings, giving a very explicit description of the 
limit metric associated to an arbitrary word metric. The main result of this section is 



Theorem 10 which is summarized here: 



Structure Theorem. For any finite symmetric generating set S of -ff(Z), the limiting 
CC metric admits a complete description of its geodesies, classified into no more than 
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| jS'I 2 — 2 1 | combinatorial types of regular geodesies and no more than \S\ types of unsta- 
ble geodesies. The unit sphere S in the CC metric is a piecewise union of the graphs 
of quadratic polynomials over finitely many quadrilateral regions in the xy-plane. 

As a corollary of this description, we can analyze the uniqueness of geodesies in 
these limit metrics, describing exactly which points x € H(R) are reached by more 
than one geodesic segment based at 0. 

We use this structure theorem in Section [4] to understand the geodesies in Cayley 
graphs for i?(Z), showing that word geodesies may be very unruly, but they are tracked 
to within a controlled distance by much better-behaved CC geodesies: the Hausdorff 



distance is of lower-order growth than the length of the word. (See Lemma 18 ) 



Tracking Lemma. Away from a certain unstable locus of endpoints, word geodesies 
are sublinearly tracked by CC geodesies. 

This opens the door to the study of geometric probability in the discrete Heisenberg 
groups, and we give an illustrative application in Thcorem |22[ quantifying the instabil- 
ity of word geodesies. Elements of a group may be said to be geodesically stable with 
respect to a generating set if, in the Cayley graph, the Hausdorff distance between 
the geodesic spellings of the word can only differ from each other sublinearly in the 
length of the word. When this is made precise, one can see that stable elements have 
zero density in free abelian groups, but full density in infinite hyperbolic groups. The 
situation is different in the Heisenberg group. 

Instability Theorem. The asymptotic density of geodesically stable elements ofH(Z) 
is a rational number strictly between and 1, depending on the generating set. 

Then in Section |5l we apply the structure theorem to a generalized Gauss circle 
problem: Theorem^] asserts that in every polygonal CC metric — a class that includes 
all limits of word metrics — we can count lattice points to first order in the annular 
shell between balls. 

Counting Theorem. For any polygonal CC metric, the number of lattice points be- 
tween the spheres of radius n and n-1 is equal to 4Vn 3 + 0(n 2 ) , where V is the volume 
of the CC unit ball. 

Note that this is strictly better than what is logically implied by the best known 
estimate of the number of lattice points in the ball of radius n, which is Vn 4 + 0(n 3 ) 
by [12] • (In fact, the full theorem is more precise than what is stated above: we can 
count points in the annulus in any radial direction.) 

Finally, we show in Section [6] that for S = std, the cardinality of the discrete sphere 
S n agrees with this annular lattice point count to first order. The counting measure on 
discrete spheres is thus shown to limit to a cone measure on the limit shape 5, for which 
we have a concrete and finite description. This enables spherical averaging — sharper 
than asymptotic density calculations, which are averages over balls — with respect to 
the standard generators. 

1.1. Nilpotent groups and the Heisenberg group. We review a few facts about 
the sub-Riemannian geometry of the Heisenberg group; see [5] for a comprehensive 
reference. 

First, we consider the 3-dimensional integer Heisenberg group i/(Z), the subgroup 
of matrices with integer entries in H(R) (the real Heisenberg group). Let m denote 
the horizontal subspace of the Lie algebra f) of H(R); that is, the span of the tangent 
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vectors Xq = |oooj and Yq = ( [j [j J ) • There are horizontal planes at every point 
in H(R), generated by pushing around X and Y by left multiplication in H(R) to 
produce left-invariant vector fields X and Y. We say that a curve in H(M) is admissible 
if all of its tangent vectors lie in these horizontal planes; it is easily verified that the 
curve 7 = (71,72,73) is admissible if and only if 

7 3 = ^(7i7 2 " 727i)- 

We will use the exponential coordinates on H given by the following representation: 

(1 x z + ^xy 
1 y 
1 

These coordinates have the property that (x,y,z) n = (nx,ny,nz). 

For integers x and y, define e(x, y) to be 1/2 if a; and y are both odd, and otherwise. 
In these coordinates, H(Z) looks just like the standard lattice Z 3 c R 3 shifted by e in 
the z direction. 

We can regard m as a copy of K 2 and make use of the linear projection n : H (R) -»■ m 
given by (x, y, z) «■ (x,y). 

Lemma 1 (Standard fact from Heisenberg geometry). For any path 7 = (71,72) from 
(0,0) to (x,y), there is a unique admissible curve 7 = (71,72,73)- The lifted curve 
connects the origin to the point (x,y,z), where z is equal to the signed Euclidean area 
of R, the region in the plane enclosed by 7 and a straight chord from (0,0) to (x,y). 

Proof. To construct the lift, we just need to supply the coordinate function 73, which 
can be done by integrating in the defining formula for admissibility. Note that x(t)y'(t)- 
y(t)x'(t) is identically zero along a linear paramctrization of the straight chord between 
the endpoints, so we can concatenate that chord with the path in the plane to obtain 
the closed curve dR. We get 

z = / 7i 72 " 727i = dxA dy, 

JdR JR 

which is the area of R. □ 

From now on the area of this region R will be called the balayage area associated to 
a curve 7. 

Choose a centrally symmetric convex body Q c m with boundary L and let || • || L 
denote the corresponding norm; this is the norm having Q as its unit ball and L as 
its unit sphere. We will use the notation mi = (m, || • || L ) for this normed plane. This 
induces a Carnot-Caratheodory Finsler metric (from now on, CC metric) on the Lie 
group: the distance between two points is the infimal length of an admissible path 
between them, measured by using the norm on the tangent vectors. (This is well 
defined for admissible curves because the tangent vectors lie in copies of m pushed 
around by multiplication in H(M), and it is a classical fact that this gives H(M.) the 
structure of a geodesic space.) Note that measuring the length of an admissible curve 
in H(R) with respect to this CC metric is equivalent to measuring the length of its 
projection to m with respect to the norm || • || L . If L is a polygon, we will call the 
induced metric a polygonal CC metric on H(M). (In arbitrary dimension, if L is a 
polytope, the associated norms are sometimes called crystalline.) 
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These CC metrics are equipped with a dilation S t (x,y,z) = (tx,ty,t 2 z) that scales 
lengths and distances by t, and areas in m by t 2 . For any set E c H(R), let AE denote 
its full cone under dilation, and let E denote its dilation-cone to the origin: 

AE = {S t (x) :xeE,t > 0} ; E = {<5 t (x) : x e E, < t < 1}. 

As a consequence of the connection between height and balayage area, we have 
a criterion for geodesity in the CC metric: a curve 7 in m based at (0,0) lifts to a 
geodesic in H(R) iff its L-length is minimal among all curves with the same endpoints 
and enclosing the same area. 

We will consider geometric probability, or asymptotic density, in the Heisenberg 
groups; for U c H(Z) and V c H(R), let B t = B t (0) be the ball of radius t about the 
origin. Then 

Prob(EQ:= lim ; Prob(VQ := lim ^11X1 . 

n^oo \B n \ r-nx> VO\(B r ) 

Note that Lebesgue measure has the right invariance properties to coincide with Haar 
measure on H(U.) up to a scalar, so we will be able to talk about ratios of volumes 
unambiguously. 

1.2. Limit shapes, limit metrics, and limit measure. We have the following 
extremely general theorem, first proven for nilpotent groups by Pansu, and extended 
to all periodic pseudometrics on simply connected solvable Lie groups of polynomial 
growth by Breuillard [2]. Here we just state it for the very special case of word metrics 
on H(Z) < H(R). We will always assume that generating sets are symmetric (S = -S). 

Theorem 2 (Pansu 9 ). Consider a word metric on H^Z) given by a finite generating 
set S . Let tt(S) be the linear projection of the generators S to the horizontal subspace 
m, let Q be the convex hull of ir(S) and L its boundary polygon. Then the limit 

S = lim 5xS n , 

n->oo n 

exists (as a Gromov-Hausdorff limit, say) and is equal to the unit sphere in the CC met- 
ric induced by the norm || • || £ on m. 
Equivalently, 

lim Ms = L 

X^OO d C0 (X, 0) 

Theorem 3 (Krat [8 ). Furthermore, there exists a constant K = K(S) such that 
d cc (x,0)-K < \x\ s < d cc (x,0) + K 

for allxeH(Z). 

The second result, bounded difference between the word metric and the CC metric, 
is stronger here in H(Z) than for the general case, where one only has that the ratio 
goes to 1. In fact Breuillard showed in [5] that there exist 2-step nilpotent groups 
where bounded difference fails. 

For the word metric (H(Z), S), we will call S and the induced CC metric the limit 
shape and limit metric, respectively. The volume of the unit ball in the CC metric 
will turn out to be an important number to attach to both the word metric and the 
CC metric itself; we will denote it by 

V=V(S) = V(L) :=vol(«S). 
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The Hausdorff dimension of H(R) is equal to 4, despite the topological dimension of 
3, as one can see clearly by considering the growth of a cube under the dilation. Thus 
the volume of the ball of radius r in the CC metric equals V-r 4 , and in i?(Z), we have 
\B n \ = V-n 4 + 0(n 3 ). 

In the previous paper [7], the case of Z d was studied in detail, and there we also 
obtained a limit measure for a generating set S. In that case Q is the convex hull 
in R d of the generators S themselves, and L is again its boundary. The limit shape 
here is given by L = lim —S n . In [7] it was shown that counting measure on -S n 

n-*oo n ' n 

converges to a measure on L called cone measure, which assigns to a measurable set 
itcL the measure ^ l (<j) = v°i(q) ' (That is, it is the proportion of the area of Q that 
is subtended by er, in this case by Euclidean dilation.) This allows us to reduce the 
problem of averaging over larger and larger S n with respect to counting measure to 
the finite problem of averaging over L against cone measure. 

Below, in Theorem 25 we will obtain the analogous result for -ff (Z) with respect to 
the standard generators, which enables us to calculate spherical averages; conjecturally, 
cone measure is the limit measure for all finite generating sets. 



1.3. Relationship to prior literature on H(Z). Pansu's seminal paper from 1983 
[9], based on his dissertation, shows that the ratio of limit metric to word metric goes 
to one for all virtually nilpotent groups with finite generating sets, as discussed above, 
and concludes that these word metrics on ff(Z) satisfy \B n \ = a-n d + o(n d ). Breuillard's 
preprint from 2007 [3] extends this result (that the growth has a well-defined leading 
coefficient) to a much larger class of groups, and computes the coefficient. Breuillard 
also gives extremely concrete geometric constructions in that paper, especially in the 
Appendix, where some explicit computations are shown for (ff(Z),std) that inspired 
the current work. 

M. Shapiro wrote a study of H(Z) with respect to its standard generators in 1989 
[TO] , giving a description of S n , then computing an exact formula for \S n \ as a poly- 
nomial function of n. This remarkable formula has not only a well-defined leading 
coefficient, but in fact all of the coefficients are computed and only the constant term 
oscillates (in fact, between twelve different values). He uses this study to establish that 
7J(Z) has infinitely many cone types, and is almost convex in the sense of Cannon. 

Stoll 1998 [T2] refines Pansu's asymptotics by showing that for 2-step nilpotent 
groups with finite generating sets, \B n \ = a-n d + 0(n d_1 ), and this error term is "of- 
ten" sharp — compare our counting theorem (Thm [24| . In a separate paper, Stoll had 
showed that 2-step nilpotent groups with infinite cyclic derived subgroup have a finite 
generating set for which the growth series is rational; on the other hand, higher Heisen- 
berg groups (of dimension 5 and up) have a finite generating set for which the growth 
series is transcendental [IT] . These results are recovered by Breuillard by geometric 
methods. 

Blachere 2003 [1 computes an exact word length formula with respect to the stan- 
dard generators, using it to show "almost-connectedness" of discrete spheres S n . We 
note that the calculations of Shapiro and Blachere for S n with respect to S = std are 
somewhat combinatorial, and that they are essentially equivalent to the more geometric 
description of S n given below in |6] (which also give directional information). 

Finally, in work based on her dissertation, Dani 2007 [3] has some striking results 
on asymptotic density: for any virtually nilpotent group, she gives an exact formula 
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for the density of finite-order elements. Further, she shows that the values of these 
densities range over Q n [0, 1] as the groups vary. 

Acknowledgments. Many thanks to Ralf Spatzier, Alex Eskin, and Emmanuel Breuil- 
lard for useful conversations on numerous aspects of the problems considered here. 

2. ISOPERIMETRY AND ISOAREA IN NORMED PLANES 

2.1. The classical problem. As seen above, understanding CC geodesies in iJ(R) 
amounts to the problem of finding paths in m of minimal L- length given their endpoints 
and balayage area. For closed loops, this is just the classical isoperimetric problem in 
the normed space m^, which was elegantly solved by Busemann in the 1940s as follows. 

Let the polar dual Q* of a convex, centrally symmetric polygon Q be defined by 

Q* = {xem: x-y<l VyeQ}- 

Then if u and v are successive vertices of Q, Q* has a vertex a corresponding to the 
edge between them which is the solution to a • u = a ■ v = 1. (That is, if the side is called 
a and the line through the origin perpendicular to a is called £, then the vertex dual 
to a is a vector pointing in the direction of I and whose length is the reciprocal of the 
length of the projection of u or v to £.) In particular, if Q is a polygon with 2N sides, 
then Q* is also a 2AT-gon. 

Theorem 4 (Busemann 4 ). Consider the norm on K 2 induced by a convex centrally 
symmetric body Q. Then the maximal ratio of enclosed area to perimeter is uniquely 
achieved (up to scale) by the isoperimetrix /, which is defined to be the boundary of 
the rotate by tt/2 of the polar dual Q* of Q. 

If Q is a polygon, then the sides of I are parallel to the vertex directions of Q. Here 
is the calculation: suppose a and b are successive vertices of Q* , so they are dual to 
successive edges of Q, sharing a vertex v. Then a side of I is obtained by rotating 
a - b by 7r/2, so that side of I is perpendicular to a - b. But since a • v = 1 and b ■ v = 1 
(by construction of a and b), we have (a - b) ■ v = 0, which shows that the side of / is 
parallel to v. 

Most of the rest of this section is devoted to solving modifications of this problem 
that are relevant for the nilpotent geometry. We will introduce classes of curves called 
trace paths and beelines, then show that these provide solutions to the Dido problem 
and the dual isoarea problem inm^. 

2.2. Trace paths. Let 1% be a scaled copy of /, scaled to have perimeter one in 
the L-norm. Fix a parametrized 1-periodic curve i ■ R -»■ M 2 so that the image of 
i is Ji, the parametrization is by arclength relative to the norm, and i traverses I\ 
counterclockwise. 

For any choice of parameters t € [0, 1] and T e (t,t + 1], let 

l(sT + t-st)~ i(t) 
trace t , r (s) = -i ^-f — , se[0,l]. 

We call these trace paths: they are parametrized paths of length one in the norm, whose 
shape follows along the perimeter of ^pzih (a scaled copy of the isoperimetrix). These 
trace paths start at the origin and therefore must terminate inside the unit ball in 
the norm (namely Q). See Figure [l] for a classification of trace paths for two different 
choices of Q. 
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Definition 5. For a given normed plane xxil, let the balayage function A : Q -»■ R 
denote the maximal balayage area among all paths of length 1 /rom (0,0) to (x,y). 

Lemma 6 (Trace paths solve the Dido problem). Given (x,y) e Q, a path 7 of L- 
length 1 connecting (0, 0) and (x, y) has maximal balayage area among all such paths 
if and only if it is a trace path: 7 = trace^T where tracet i r(l) = (x,y)- 

Proof. Consider a particular trace path with trace tj T(l) = (x,y). Let a be the straight 
chord from (0,0) to (x,y) and let A = Let A tt denote the area enclosed by 

trace t ^ and a; we want to show that this is maximal. Note that the trace path can 
be completed to XI\ (a scaled copy of I) by extending the domain up to s = A. Thus a 
can be realized as a chord of Aii that cuts it into two pieces, and A tr is the area of one 
of those two pieces. But now suppose there were a different path 7 in M 2 , of length one 
in the i-norm, connecting (0,0) to (x,y) while enclosing maximal area A(x,y) > A tr 
with the chord a. Maximality of area implies convexity, so in particular the path 7 
always stays on one side of the line through a. But then 7 can also be concatenated 
with tracet j T([l, A]), producing a figure which must be a copy of A/i, by uniqueness of 
the isoperimetrix. Thus A tr = A(x,y), showing that trace paths enclose maximal area 
among all paths of length one, as desired. Indeed, this also means that the image of 
7 is uniquely determined, which forces it to be a trace path, though it is possible that 
7 = trace ti T = trace t ',T' for another choice of parameters with t' = t + k, T" = T + k. □ 

From here on we will assume that Q is a 27V-gon, in which case I\ will also be a 
27V-gon and 1 will be piecewise affine: t(s) = sa + b for fixed vectors a, b when s is in 
the subinterval of values corresponding to a side of /. Let t be chosen so that t(0) is a 
vertex in a copy of I\ and let ai,a 2 , - ■ ■ , c 2 jv be the sides listed counterclockwise. Let 
li be the sidelcngth of Oi for all i, so that £^ = 1. 



This is the subset of Q consisting of endpoints trace t ,t(1) °f those trace paths whose 
shape on Ii begins on the ith side and ends on the jth side. 

We will say that two continuous, piecewise linear paths have the same shape if there 
exist vector directions Vi, . . . , v& such that each path follows a\W\,a 2 W2, ■ ■ ■ ,ik^k in the 
same order, with ai,dk > 0, 02, ... , ak-i > 0, and + Vj+i for all i. 

Theorem 7 (Combinatorics of trace paths). If Q is a polygon with 2N sides, then there 
are AN 2 - AN quadrilaterals Qij , covering Q and overlapping only on their boundaries, 
such that the trace paths have "constant combinatorics" on each piece: that is, the 
trace paths ending at any two points in Qij have the same shape. 

Proof. We will show that the quadrilaterals Qu and Qi^+i are degenerate (have no 
area in m). Discarding those, there remain (2iV)(2./V-2) nondegenerate quadrilaterals, 
which will be seen to have the properties described in the statement above. 

It will be helpful to note right away that within a given , each level set in t or T is 
a straight line. This is because 1 is affine on each edge. Thus, within the quadrilaterals 
and for fixed T, the function trace is of the form (ia + b)/(c- t), so its derivative is 
(c-t) 2 [ ca + b]i which has constant direction. This shows that t- level sets (and, similarly, 
T-level sets) are a family of straight segments sweeping out each quadrilateral, though 
not in general a parallel family. Let us also consider level sets of trace at T - 1 = k. In 
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Figure 1. The combinatorics of trace paths for a square and a 
hexagon. We can count 8 and 24 nondegenerate quadrilaterals, re- 
spectively, with constant combinatorics on each. 



FINE ASYMPTOTIC GEOMETRY IN THE HEISENBERG GROUP 



9 



this case we have 

i(t + k)-i(t) 
trace M+fe (l) = -i £ ^, 

which produces a parallel family of straight segments within Qij indexed by k. 

We now consider paths starting and ending on the ith side. This quadrilateral Qu 
has two components: either the whole trace path is contained in the zth side, or the 
trace path traverses all the sides of I before returning. Recall that the side a, is parallel 
to a vertex direction from L, say for vertex v^. If the trace path is totally contained in 
(Tj then the image trace^rCl) is the vertex Vj itself, because it points in that direction 
and has length one in the L norm. In the other case, we point in the -v, direction. 
The magnitude of the vectors in this component of Qu range from (achieved when 
T = 1 + 1, so the path closes up) to - l{) (achieved when t is at the end of Uj and 
T has wrapped around to the beginning of ctj). Thus each Qu is a vertex of L together 
with a line segment in the interior of Q. 

Now we will show that Qi,i+i is the edge of L with endpoints Vj, Vj+i. This follows 
simply from noticing that each of those trace paths is a two-sided path following 
direction Vj for time a and then direction Vj+i for time 1 - a, so that it terminates at 
the point av^ + (1 - a)v i+1 . Notice that these particular trace paths are geodesic in R 2 
with respect to the L-norm; and indeed since all trace paths have L-length one, they 
are L-norm-geodesics if and only if they terminate on L itself (the sphere of radius 
one). 

For the remaining quadrilaterals, there are four extreme points: we may start at 
either endpoint of Cj and terminate at either endpoint of o~j. Indeed, there is a quadri- 
lateral traced out by holding the start point or end point on L fixed at one extreme 
while moving the other from extreme to extreme, then alternating which is fixed and 
which is moving, making a circuit of length four. (Its four subpaths are level sets for 
t or T, so they are straight lines.) 

Finally, observe that trace tit+fe (l) is a convex, centrally symmetric polygon for every 
fixed k 6 (0, 1]. This polygon varies continuously in k, is identically zero when k = 1 
and equals L itself for k sufficiently small. We can conclude that all points in Q are 
hit by trace paths. □ 

2.3. Beelines. Let us call a path in R 2 a beeline if it has L-length one and connects 
the origin to a point on L. That is, beelines are L-geodesic segments emanating from 
the origin. One can check that a given path ending interior to a side a of L is a beeline 
by verifying that all of its tangent vectors point towards a. The name reflects their 
consistent progress away from the origin, as opposed to trace paths which typically 
make turns in several directions that seem inefficient in the L-norm. 

Lemma 8 (Existence of beelines). Every point (x,y) € L is reached by a unique trace 
path trace ti T, and this path encloses a nonnegative area A = A(x, y) . There are beelines 
to (x,y) enclosing every signed area in the range [-A, A]. 

Proof. If (x, y) is in the edge of L with extreme points Vj and v i+1 , then the trace path 
to (x, y) follows two directions, first going distance a in direction and then distance 
(1 - a) in direction Vj+i. Now vary s from to a and consider the beeline that goes s 
in direction followed by (1 - a) in direction Vj+i and finally (a - s) in direction Vj. 
This is clearly still an L-geodesic, since it has length one and terminates on L. But 
the area it encloses with the straight chord between its endpoints varies continuously 
from -A when s = to A when s = a. □ 
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In general, the full set of beelines to a side with extreme points and Vj+i consists of 
parametrizations by arclength of arbitrary piecewise differentiable paths whose tangent 
vectors lie in the interval of directions between those two extremes. 

Lemma 9 (Isoarea problem). A curve in Q of L -length 1 has minimal length among 
all curves with the same endpoints and balayage area if and only if it is a trace path 
or a beeline. 

Proof. The reverse implication is easy: we have already shown that trace paths uniquely 
maximize area to their endpoints. Since beelines are L-norm geodesies, there is no 
shorter path between their endpoints. 

Now for the forward implication: Let p be a curve in Q of L-length 1 which has 
minimal length among all curves with the same endpoints and balayage area. If f3 
ends on L, then it is a beeline by definition. Otherwise say /3 ends at (x,y) e Q° 
and has balayage area A'. We know that there is a trace path 7 = trace^T with 
trace f T (l) = (x,y) enclosing area A(x,y) > A'. If A(x,y) = A', then (3 is a trace path 
by Lemma [6j If not, then we can modify 7 to be an improvement on /3: we know that 
the region enclosed by 7 is convex, so we can find a chord of this body starting and 
ending on 7 so that the modified curve 7', which follows 7 except for a shortcut along 
this chord, has length < 1 and encloses area A'. This contradicts the minimality of the 
length of p. □ 



3. Structure of polygonal cc metrics 

3.1. Unit sphere and CC geodesies. Returning to the Heisenberg geometry, we find 
that we have identified the geodesies, and can thus map the shape of spheres. 

Recall that Pansu's theorem tells us that the word metric (H(Z),S) is asymptotic 
to the CC norm on H(R) induced by L, the boundary of the convex hull of ir(S). Note 
that the number of sides of L is at most \S\, but could be smaller if S has several 
elements with the same projection to m, or has elements which do not project to 
vertices of L. If L is a 2./V-gon, then there are AN 2 - AN quadrilateral regions Qij 
classifying the shapes of trace paths, while the 2N sides of L classify the beelines. 

Theorem 10 (Structure Theorem). Consider the CC metric induced on H(R) by a 
polygonal norm \\ ■ \\ L on m. The balayage function A(x,y) is a continuous function 
over Q whose restriction to each Q t j is a quadratic polynomial in x,y. 
Let z(x,y) be the multivalued function given by 

({±A}, (x,y)eQ° 
jt-iM], (x,y)eL. 

Then the graph of z is precisely the unit sphere S in the CC metric. 

Equivalently: the full set of CC geodesies of length one based at the origin is identical 
to the set of admissible lifts of trace paths and beelines ( and their reflections in the 
origin). 

Proof. For points (x, y, z) e H(U.) with z > and (a;, y) € Q° , saying that an admissible 
path a from to (x,y,z) is geodesic means that n(a) is the shortest path in m from 
(0,0) to (x,y) enclosing (nonnegative) balayage area z. By central symmetry of the 
isoperimetrix, we can solve the problem for minimal signed area by traversing the 
isoperimetrix clockwise rather than counterclockwise. Therefore the minimal signed 
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area enclosed by a path of length 1 from the origin to (x,y) is given by -A(—x, -y) = 
—A(x,y), which tells us that the graph of z is symmetric over the xy plane. 

Continuity of A(x,y) on Q is is immediate from the continuity of trace(i,T) in its 
parameters. It only remains to show that the restriction of A(x, y) to each Qij is a 
quadratic polynomial. 

Observe that the problem of finding the trace path to a point (x, y) € Qij amounts 
to finding scalars s±,s, s 2 (with Si,s 2 i s) such that 

(x,y) = s 1 w l + s(w i+1 + ••• + Wj_i) + s 2 Wj, 

where Wj are the vectors defining the sides of 7, as in Figure [2j 




SW. 1+ i 



Figure 2. The balayage area of a path that follows along a scaled 
copy of the isoperimetrix is given by a quadratic expression in si, s 2 , s. 



The area enclosed is equal to that of a polygon of fixed shape with j - i sides of 
length proportional to s, so that its area is proportional to s 2 , plus two triangles of area 
proportional to s% • s and s 2 • s, respectively. There are three linear equations relating 
si, S2, s, given by the total arclength equaling one and the Euclidean displacement be- 
tween endpoints equaling x and y in the horizontal and vertical direction, respectively. 
Thus we can solve to get expressions for Si,s 2 ,s that are linear in x, y; this means that 
the area is quadratic in x, y, as required. □ 

By the definition of a CC metric, the sphere of radius t based at is precisely 5tS 
(so S is homeomorphic to a 2-sphere), and the spheres with other centers are obtained 
by pushing these spheres around by left-multiplication in H(M). 

We want to break down the sphere, which is the graph of z over Q, into the graphs 
over the various quadrilaterals Qij. We will call these the panels of the sphere, and 
define them by 

Panely := {(x,y,z) : (x,y)eQi 3 , z = ±A(x,y)}, + 

and 

Panel M+ i := {(x,y, z) ; (x, y) e Q i)i+1 , ze[-A,A]}, 

so that S = Ui<i,j<27V Panel jj. Note that we can leave out the Panels without changing 
this equality, because they are contained in the union of the other panels. The Panelj^+i 
project to edges of L, and the fact that z is quadratic over the interior implies that 
they are bounded above and below by parabolas in a plane through that edge that is 
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perpendicular to m. We will call these the side panels of S (these are the panels cut 
away in Figure [3]) . 

The combinatorics of trace paths on quadrilaterals dictates the shape of CC geodesies 
in the following straightforward way. Let the footprint map Foot : H (M) -*■ Q be defined 
by 

Foot(x) := 7T o Sy d (x) = Sy d o tt(x), 

where d = rf cc (x, 0). Note that all geodesies from to any point are just dilations of 
geodesies from to S. These in turn are lifts of trace paths and beelines, whose shape 
is determined by which quadrilateral contains the projection from S to Q. That is, 
the shapes of all possible geodesies from to x are determined by which quadrilateral 
contains Foot(x). 

Let the regular points in the unit sphere be the union of the panels over the quadri- 
laterals interior to Q and the unstable points be those in side panels. 

Sre g = U Panels ; <S uns = (J Panel M+ i , 

so that S = S reg uS uns . Recall that the points of Q° are reached only by trace paths, 
while points of L are reached by families of beelines. We have 

AS reg = Foot _1 (Q°) ; AS uns = Foot _1 (i). 

That is, the regular/unstable distinction divides the space H(M.) according to which 
of the two types of geodesic reaches each point. 




Figure 3. Two views of the limit shape S for the hexagonal gen- 
erators, with the quadrilateral decomposition of the footprint Q re- 
produced for reference. Only the regular part, <S reg , is shown. The 
unstable part, <S un5 , is cut away in the picture: it is made up of six flat 
panels perpendicular to the sy-plane that are bounded by parabolas. 

Let the volume subtended by a panel be the volume of the region obtained by coning 
off to the origin by dilation. We will see that the the decomposition of the sphere into 
regular and unstable points gives us useful invariants coming from volume: 

Vreg = vol(5 reg ) ; Vuns = vol(<S uns ). 

Recalling that V is the volume of the unit ball, we have V = V re g + Vuns- 
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Remark 11 (Rationality). We note that Breuillard carried out a full description for 
the standard generators of the kind given in this section, and indicated key elements 
of such a description for general 5, in [2J Prop 9.1]. For instance, it is stated without 
argument there that the balayage function should be piecewise quadratic. Breuillard 
sketches an argument that V (the volume of the unit ball for any limit metric) is ratio- 
nal, noting that Q has integer vertices, so its polar dual and therefore the isoperimetrix 
I must have rational vertices (since Q* = {x e m : x - y < 1 VyeQ}). Following this 
line in our language, we see that the vectors have rational projections to each coor- 
dinate direction as well as rational length in the L-norm (though not in the Euclidean 
norm), and so the linear relations between x,y and si,S2,s described in the proof of 
Theorem [lO] are rational, which ensures that the balayage area over Qij is a rational 
quadratic polynomial in x and y. Furthermore, the vertices of the quadrilaterals Qij 
are rational (when a trace path begins and ends at a vertex, and has arclength 1, then 
each coordinate of the endpoint is a sum of rational numbers, scaled by a rational num- 
ber). Thus the volume subtended by each panel is given by an integral over a region 
bounded entirely by graphs of rational quadratic polynomials (both the balayage area 
and the tracks of the dilation) over rational polygonal domains, so each such volume 
is in fact rational. 

Remark 12 (Comparing metrics). Notice the following fact that easily follows from 
Pansu's theorem from this geometric description of limit shapes. For an arbitrary 
generating set S of H(Z), despite the fact that the word-length of generators is one 
by definition and the CC metric is the limit of the rescaled word metrics, it is easily 
possible for elements of S to be artibrarily far from in CC distance: one can have 
d cc (a, 0) » 1 = ]a|s for some a e S, meaning that the generator lies far outside of S. This 
is because d cc depends only on n(S) and not on the z-coordinates of the generators. 
However, if 7r(a) is not a vertex of L, we will eventually have d cc (a™, 0) < |a™|s because 
the z-coordinate of a" grows only linearly, while the height of S n S over ir(5 n a) grows 
quadratically in n. It is only over the vertices of L that the height of S is zero. In 
that case, we see that d cc (a™, 0) < |a n |s + 1 for sufficiently large n, because the dilation 
causes the curves defining the sphere near its corners to get more and more vertical as 
the radius gets large. This establishes that there is a constant K = K(S) such that 

d oc (x,0) < \x\ s + K 

for all x e ff(Z), which is half of Krat's bounded difference theorem (Thm|3|. 

3.2. Nonuniqueness and regular points. The union of the degenerate quadrilat- 
erals has a geometric significance. Recall that the quadrilateral Qu is a line segment 
based at the origin along with a vertex. Let denote the other endpoint of the line 
segment emanating from 0. The other degenerate quadrilaterals cover the boundary 
L, whose vertices are called v.;. Then, with the convention that Q2N,2N+i = Q2N.i1 let 

N := U Q^^ S -W^} L = U Qi,*H ^ fa} = iA {v<}. 

l<i<2AT l<i<2N 

Now define N := N u L . 

Proposition 13 (Uniqueness of CC geodesies). A point x e S is reached by more than 
one geodesic of length one based at the origin if and only if 7r(x) € N. 

Thus a point x 6 H(M.) is reached by more than one geodesic based at the origin if 
and only if Foot(x) e N. 
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Proof. First, we consider the case of trace paths. It is impossible for a convex polygon 
(in this case XIi) to have two different interior chords which are parallel, have the 
same length, and subtend the same perimeter on each side. Thus the only way this 
non-rigidity can occur is for the chord a to be contained in a side of XI\. But this is 
precisely the case when the start and end points of trace fall on the same side, which 
was examined in the discussion of Qu. The vertices of L have trace paths enclosing 
zero area, so the geodesies are unique in that case. Finally, if 7r(x) is interior to a side 
of L, then geodesies are lifts of beelines that are not straight lines. These can always 
be perturbed to nearby curves with the same endpoints, length, and enclosed area. □ 

From now on we call N the nonuniqueness locus in Q. 

Theorem 14 (Probability of unique geodesies). In H(R) with the CC metric induced 
by a polygon L, the asymptotic density of unique geodesies equals V re g/V . 

That is, choose x uniformly in the ball of radius r and consider the probability that 
there is only one geodesic from to x. For every r, this probability equals Vreg/V ■ 

Proof. The only instances of nonuniqueness occur over N, as we have seen. But 
Foot -1 (No) has no volume, and none of the volume of A (Suns is contributed by the 
vertices. Thus the probability of being reached by distinct geodesies is precisely the 
proportion of the volume of the unit ball that is in the unstable part (i.e., subtended 
by the side panels). □ 

Example 15 (Volume calculations in i 1 norm). Let || • ||l be the I 1 norm on m and 
give H(R) the corresponding CC metric. We find that the volume subtended by panels 
with 4-sided combinatorics is 13/216, the volume subtended by 3-sided panels is 11/54, 
and the volume subtended by the side panels is 1/6. Thus Vreg = 13/216 + 11/54 = 19/72 
and V uns = 1/6. This adds up to give the total volume of the unit ball as V = 31/72, 
which agrees with calculations by Breuillard [2 and Stoll fTT] of the volume growth. 

Thus for this standard polygonal CC metric, the density of points reached by nonuniquc 
geodesies is 12/31, or about 39%, versus 19/31 « 61% for unique geodesies. 

4. Geodesics in the word metric 

4.1. Comparison of word and CC geodesics. Recall that the Hausdorff distance 
dH3us{ct, f3) between a pair of sets is the smallest e > such that each set lies in the 
e-neighborhood of the other. Below, we will talk about paths and the images of those 
paths interchangeably, to make sense of the Hausdorff distance between paths. The 
goal of this section is make precise the following statement: "CC geodesics approximate 
word geodesics." 

Lemma 16 (Continuity of CC geodesics). Fix an arbitrary p > 0. Suppose we are given 
x e S such that 7r(x) f JV p (Nq) and a CC geodesic a from to x. Then for every e > 
there exists 5 > such that whenever y € S with d cc (x, y) < 5, there exists a CC geodesic 
(5 from to y with dHaus(a, P) < £• 

Proof. This is clear for points interior to any regular panel. For points x in the bound- 
aries of regular panels, one can see from the combinatorial description that a sequence 
of points approaching x from any panel must have geodesics approaching the unique 
geodesic to x (see Figure [I]). Suppose x is in the interior of a side panel and y is close to 
x. If 7r(x) = 7r(y), then homotopc n(a) to a path [3 with the same endpoints such that 
the tracks of individual points are small. As long as every tangent vector of every point 
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on every intermediate path points towards the same side, this is a homotopy through 
beelines. If 7r(x) + 7r(y), then we can concatenate this homotopy with another simi- 
lar homotopy ending at a path from the origin to 7r(y) without changing area much. 
Either way, lift /3 to a geodesic (3 ending at y which is close to a. □ 

This continuity statement is false if 7r(x) e No- For instance, take x to be the point 
on S above and {y ra } to be a sequence of points interior to a panel, converging to 
x. Then for each n, there is a unique geodesic 7„ from to y„; these converge to a 
particular one of the many geodesies from to x, which include the lifts of trace*. t+i for 
all t € [0, 1]. These other geodesies to x are thus not closely approximated by geodesies 
to the y„ even as y„ -> x. 

Let S be a finite generating set for H(Z), and K be the maximum of d cc (0, a) over 
all generators a € S. Minimal-length spellings w = ay-a n written in letters from S may 
be realized as discrete geodesies xo,Xi, ...,x„ in H(M) where xo = 0, Xj = ai"-aj, and the 
CC distance between successive points is no more than K. We will say that the discrete 
geodesic Xo,Xi, ...,x„ is e-linearly tracked (or just e-tracked) by an admissible path a if 
they have the same endpoints and the set {x,} stays inside the en-neighborhood of a. 

Lemma 17 (Geodesic spellings vs. admissible paths). For any e > and x e H(Z) 
with sufficiently large n = \x\, every discrete geodesic from to x is e-linearly tracked 
by an admissible path of length n + 0(\fn). 

Proof. Let z(x) denote the height (i.e., the z-coordinate) of x e H(R), and M := 
m&xz(S). Let Xo,...,x„ be a discrete geodesic from to x e H(R) in letters from S. 
Define a polygonal path 7 in m by connecting vertices 7r(xo), . . . , 7r(x„) with straight 
lines. 

The curve 7 has a unique admissible lift 7; its endpoints are at and at a point 
y e H (R) which lies in the same vertical line as x. Since tt(S) c Q, the curve 7 lies in 
nQ, and since it is made up of straight segments of L- length < 1, the length of 7 (and 
hence 7) is bounded above by n. We will establish the following equation: 

n 

(t) z(x) = z(y) + ^z( ai ) 

i=l 

This says that the height of x is the balayage area of 7 plus the heights of all of the 
letters used in the spelling. 

Equation ([f]) can be proven by induction. The group law, in exponential coordinates, 
says 

/ \r 1 1 >\ ( , ' 1 1 , x y ~ V x I 

{x,y,z){x ,y ,z) = Ix + x , y + y , z + z + I. 

In particular, if x„_i = (x,y,z) and a„ = {x' 1 y' ,z'), then we see that the change in 
height between x„_i and x„ = x„_ia„ is equal to the height of a„ plus the area of an 
appropriate triangle in m. That means that the height of the spelling path differs 
from the height predicted by balayage area by precisely the height of the generators, 
as claimed in ([f]). 

Now set C = d cc ((0, 0, M), 0), so that 

4c(0,(0,0,Mn)) = d cc (0,^(0,0, A/)) = Cy/n. 

Let a be a path obtained by concatenating 7 with a CC geodesic from y to x. From 
|f]) it follows that 

\z(xi) - z(yi)\ < Mi < Mn, 
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for all i, where yi is the point on 7 corresponding to the vertex x^. Therefore d cc (x^, y^) < 
C%/n, so the discrete geodesic lives in the C\fn- neighbor hood of a, which means that 
the path Cn _1 ^ 2 -tracks the spelling. Furthermore, the difference in lengths between a 
and 7 is bounded by C\fn. Thus the length of a is bounded above by n + C\Jn. On 
the other hand, Theorem [3] provided a global constant C such d cc (0,x) > n-C . This 
proves that the length of a is n + 0{s/n). □ 

Lemma 18 (Tracking Lemma). For every e > 0, p > 0, and sufficiently large x € H(Z), 
if Foot(x) £ A/^No), then every discrete geodesic from to x is e-linearly tracked by a 
CC geodesic. 

Proof. Suppose otherwise; then there is an e > such that for every n > 0, there 
exist points x„ e H(Z) such that Foot(x n ) i A/" p (No), |x n | > n and there are choices of 
discrete geodesies from to x„ which stay e|x n |-far away from any CC geodesies to the 
same point. Let a n be the admissible paths approximating these discrete geodesies, as 
described above. 

Denote x„ = 5 1 /\ Xn \x n and a„ = Sij\ Xn \a n . The lengths of the a n are converging to 1 
by Lemma 17 By passing to a subsequence, we may assume that the x n converge to a 
limit x € S with 7r(x) £ jV p (No), and that the paths a n converge in Hausdorff distance 
to a geodesic a from to x by Arzela-Ascoli. By Lemma [l6j there is a large enough 
n such that there is a geodesic (3 n within a Hausdorff distance of e/2 from a, and for 
large enough n we know that a n is within e/2 of a. But then for /3„ = S\ Xn \P n , we have 
^Haus(an,/3 n ) < e|x„|. So the discrete geodesies are e-tracked by the CC geodesies f3 n , 
giving us a contradiction. □ 



A key idea in above proof is that additive quasi-geodesics get close to true geodesies 
as the additive constant goes to zero. Again, this fails near Nq. 



(n, n) 
(0,0)/ 




Figure 4. In these diagrams in (m, i 1 ) , we see a true geodesic a n from 
(0, 0) to (n, n) on the left and a path f3 n with different combinatorics, 
enclosing the same area but with excess length, on the right. When 
these are rescaled so that their lifts terminate on the sphere of radius 1, 
the ratio of lengths goes to 1 while the Hausdorff distance is bounded 
away from zero. 



Example 19. Let x„ = (n,n, n A - \n 2 ). There is a unique CC geodesic a n from the 
origin to x„, and its projection to the horizontal plane is shown in Figure [4] on the 
left. On the right is pictured a non-geodesic path j3 n to x„ which has length An 2 (while 
the true geodesic has length L n = in 2 - 2n). Let y„ = Si/ Ln x n be the dilate of x„ that 
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lies in the unit sphere S. Note that y„ -> (0,0, Vi6) as n -* oo. By scaling back the 
paths (3 n by the same factor, we obtain paths whose lengths approach 1. However, 
the Hausdorff distance between a n and (3 n is n 2 - n, so after scaling they still have a 
Hausdorff distance of at least 1/4. In the limit as n -*■ oo, both kinds of paths limit to 
valid closed geodesies to (0,0, V 16 )- But for any finite value of n, the path with the 
wrong combinatorics is far from any geodesic. 

4.2. Instability of geodesies in H(R) and H(Z). 

Definition 20. For a geodesic space X with basepoint xq, let G(x) be the set of all 
geodesies from xq to x. Then the instability function I(x) measures how far apart they 
can be, relative to the size of x: 

diam£(:r) 2 

I(a):=2 -^^ = ^- a ,S p (.) dHM(a ^ ) ' 

where the metric on Q{x) is given by Hausdorff distance. 
We will say that a point x is e-stable ifl(x) < e. 

In general, diam^(x) < d(x,Xo)/2, because any point on a geodesic from x to x 
is within d(x,xo)/2 of one of the endpoints, and therefore within that distance of any 
other geodesic. Thus the factor of 2 in the definition normalizes I so that < l(x) < 1 
for any X, Xq, x. 

Note that if X has unique geodesies, then 1 = 0. If X is <5-hyperbolic, then 
diam£/(x) < S for all x, so l(x) -*■ as x -*■ oo. 

The asymptotic behavior of this instability function is far from being quasi-isometry 
invariant. Obviously I = on the Euclidean plane M 2 , but the situation is quite different 
for M 2 with a polygonal norm. For instance, in the I 1 norm, the elements along the x 
and y axes have 1 = 0, but the diagonal has I(a, a) = 1. It is easily verified that with any 
generating set, the limit shape L divides the plane into sectors, and the instability is 
on the boundaries of the sectors and 1 on the midlines, varying linearly between these 
extremes along the sides of L. In particular one can recover the vertex directions of L 
from the function I. Since the word metric on (Z 2 , S) is asymptotic to rrii = (R 2 , || • \\ L ), 
it is also true for word metrics that I detects the directions of significant generators. 

The goal of this section is to study instability in the Heisenberg group. As one would 
expect, H(Z) is more geodesically stable than Z 2 but less stable than a hyperbolic 
group. More intriguingly, the stability depends on the generating set in a different way 
than for those other groups. 

We will call an element H(Z) regular if it is regular as an element of H(R). 

Lemma 21 (Stability of regular points). For every e > and p > 0, all sufficiently 
large regular points x e H (Z) with Foot(x) £ N p {Hq) satisfy 

d» 3 us(g,h) < e|x| 

for all word geodesies g, h from to x. 

Proof. Let S be a finite generating set for H(Z) and give H(R) the corresponding 
CC metric. Let T > be large enough so that when |x| > T and Foot(x) £ Af p (N ) 
and g is a word geodesic from to x, then there is a CC geodesic 7 from to x such 
that diHaus(ffj7) < f \g\ = fl x l- Since regular points have unique CC geodesies, we are 
done. □ 
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Then since the proportion of A<S reg lying over jVp(No) goes to zero as p -*■ 0, we 
can conclude from the point of view of geometric probability that all of the regular 
part of H(M.) is e-linearly stable for every e, so that Hausdorff distance between word 
geodesies is bounded above by every linear function of word length. That is, for 
regular points of H(Z), word geodesies satisfy a sublinear fellow traveling property. 
This is much stronger control than one has in Z d , where only words that are powers of 
single generators can be e-stable for every e, so almost all points are unstable. At the 
other extreme, all elements sufficiently far from the basepoint in a hyperbolic group G 
are stable. The Heisenberg group is intermediate between these in terms of geodesic 
stability: only the regular part is e-stable for all e, and its measure depends nontrivially 
on the generators. 

To sum up this comparison: 



Theorem 22 (Instability Theorem). With respect to any finite generating set, 

0, x = z d , 

V reg /V, X = H(Z), 

1, X unbounded, (5-hyperbolic. 



limProbfl(x) < e) 

e^o v v ' ' 



5. Lattice point counting 

Because the balls in the CC metric get "fat" (i.e., the volume of a 1-neighborhood of 
the sphere is lower-order than the volume of the ball), it is easy to see that the number 
of lattice points in a ball of large radius equals the volume of the ball (to first order). 
What is much more delicate is to count lattice points in an annular region between a 
sphere of radius n and a sphere of radius n — 1. 

As before, the footprint of the limit shape, Q, is covered by finitely many quadrilat- 
erals Qij. For a subset U of H(R), let A n U = {S t (U) ■ n - 1 < t < n} be the "annular" 
region between the (n - l)st and nth dilate of U. 

Recall the functions z and A described in Theorem [lOj These functions are related 
by A = maxz; that is, A is the height (maximum z-coordinate) of the unit sphere in the 
CC metric over points in Q, or equivalently, it is the maximum balayage area of curves 
of L-length 1 from the origin to (x,y). We can compute the height of the CC sphere of 
radius n above (x, y) e nQ using the rule H n (x, y) = n 2 -A( x /n, f/n). (Note that for fixed 
(x,y) and large enough n, this is continuous and strictly increasing in n.) The unit 
ball is compact, so A takes a maximum value A max on Q (the largest height achieved 
by the unit sphere). Then for fixed (a;,y), the height function H n (x,y) is bounded 
above by A max ■ n 2 . But this is not by itself good enough to get control on the height 
difference between the n-sphere and the (n - l)-sphere over a point, h n := H n - i? n _i. 

Lemma 23 (Control on height). Take a polygonal CC metric on H(R) and let h n (x, y) 
be the height difference between the spheres of radius n and (n - 1) over (x,y) for 
(x,y) e (n - 1)Q. Fix any nondegenerate quadrilateral Qij and let VL n c nQij denote 
the complement of the 1-neighborhood of the boundary. Then h n = 0(n) over VL n . 

Proof. First of all, for x e f2„, one easily verifies that both ^x and -^rX lie in Qij. 
Recall from the structure theorem (Thm |Tu] ) that the nonnegative part of S over 
Qij is the graph of a single quadratic polynomial in x,y ; denote this polynomial by 
f(x, y) = a\X 2 + a^xy + a^y 2 + b\x + 622/ + c - F° r ai l (x, y) 6 ^n, we have 

(x,y, n 2 -f{l,y-)) e 5 n (Panel y ) 
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SO 

h n (x,y) = n 2 -f(x/n,y/n) - (n - l) 2 -/(*/n-i,«/n~i) = b x x + b 2 y + 2cn - c. 
Since (x, y) e fi„ c nQij c nQ, both a: and y are O(n), and therefore h n is as well. □ 

Theorem 24 (Counting Theorem). Consider the CC metric induced on H(R) by a 
norm \\-\\ L on m, with L a polygon. Let a be an arbitrary measurable subset of S, the 
CC unit sphere. Then 

# (H(Z) n A„er) = 4n 3 voI(ct) + 0(n 2 ). 

Proof. First we consider a c <S reg . For this case, it suffices to treat small subsets a 
which project to squares in m whose closures are contained in the interior of a single 
quadrilateral: 

U ■= 7r(cr) = (u, u + e) x (u, v + e) 

such that [it, u + e] x [v, v + e] c . Since the dilation 5 t is just a homothety on m, the 
dilate tU is a square for every t. The condition on the closure of U guarantees that 
when n is large enough, nil does not intersect the 1-neighborhood of the boundary of 

Consider the rectangle U n ■■= nil n (n - 1)U, let A n ■■= A„er n 7r _1 ([/„) be the part 
of the nth annular shell that is vertically over U n , and let B n := A n a \ A n be the rest 
(see Figure [5]). 





(J ^/^^^^ 












u u n 



Figure 5 . This shows an xz-plane slice, with J7 n along the £-axis, A n 
in blue, and B n in red. We have A n uB n = A n a; we want to count the 
lattice points in that entire region. The A n lattice points are easier to 
count, and the number in B n is lower-order. 



We will make the following chain of comparisons: 

# (H(Z) n A n a) - # (H(Z) n A n ) ~ vol( A n ) ~ vol(A n a) ~ 4n 3 vo1(<t), 

while keeping track of the error term. 

First we will estimate the number of lattice points in J5 n , and its volume. The 
projection ir(B n ) is contained in a square annulus in m of width O(l) and sidelength 
O(n). Thus the projection contains at most O(n) lattice points (x, y). By the previous 
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lemma, the difference in heights between S n S and S n -iS over each of these points is at 
most O(n), so the total number of lattice points in B n is at most 0(n 2 ). Thus 

#(JT(Z) n A n a) = #(ff(Z) n A n ) + 0(n 2 ). 

Likewise, the volume of B n is bounded above by integrating h n (x,y) over (x,y) e 
n(B n ), so vol(B n ) = 0(n 2 ). This shows that vol(A n ) = vo1(A„ct) + 0(n 2 ). For the 
comparison on the far right, we have 

vo1(A„ct) = vol(S n a) - vo1(£„_ict) = [4n 3 + 0(n 2 )] voI(ct), 

because 5 t expands volume by t A . 

All that remains is to compare #(iJ(Z) nyl n ) with vol(A„). Fix attention on an 
integer point (x,y) e U n . For all the lattice points in the vertical line over (x,y), let 
us count the number in A„ct. We find #(if(Z) ni„) = (E(7„ n z 2 ^n) + 0(n 2 ), because 
nZ 2 ) = 0(n 2 ) and there is bounded error on each line. On the other hand, 
vo1(j4„) = fjj h n . Regard the sum as a Riemann sum approximation to the integral 
over the integer squares. Since h(n) is linear, the Riemann sum gives an equal error 
on each of the 0(n 2 ) squares, for total error of order n 2 . 

Next we deal with the case a c <S un5 , where it suffices to consider a = 7 x [h, h + e] 
with 7 a subinterval of an edge of L. Let £ = A7 denote the sector of m determined 
by 7, and let U n be the trapezoid in S between ny and (n - 1)7. We see that Act = 
Ut>o^7 x [ht 2 ,(h + e)t 2 ]. Let the height difference from top to bottom at (x,y) be 
denoted T(x, y) = e- 1| (x, y) || L . 

We shall make the comparison 

#(if(Z)nA„CT)~vol(A„CT) 

by considering the Riemann sum approximation of J v T by Z[/„nz 2 T. For any u, v € 
U n , the L-norm is between n- 1 and n, so T(v) - T(u) = e||v|| L - e||u|| L < 2en, which 
gives a bound on the error per term in the Riemann sum. Since U n has bounded width, 
its area and #Z 2 n U n are both 0(n). Putting these factors together, we get that the 
total error is 0(n 2 ), so we have shown that #(ff(Z) n A„ct) = 4n 3 voI(ct) + 0(n 2 ), as 
in the regular case. □ 

6. Limit measure in the standard generators 

To get more precise results that allow averaging over spheres and not just balls, 
we focus attention on H(Z) with the standard generating set S = std = ±{ei,e2} and 
consider the word metric with respect to those generators. The goal for this section 
is to show for this special case that counting measure on discrete spheres in the word 
metric limits to cone measure (with respect to Heisenberg dilation) on the limit shape 
S. 

Theorem 25 (Limit measure for standard generators). Consider the CC metric in- 
duced by the norm on m, which is the limit metric for (iJ(Z), std) . For any mea- 
surable set a c S, 

#(S n n Act) = n 3 voI(ct) + 0(n 2 ), 

and therefore 

Hm #(S n n Act) = voI(ct) 
«™ #5„ vol(«S) ' 
Thus for the discrete spheres S n in (i?(Z),std), the counting measure on Si/ n (S n ) 
converges to cone measure on the square L c m. 
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To prove this, we will study the distribution of points in S n , finding that the set 
S n has points on the line {(a;, y, t) : t > 0} if and only if x, y are integers and x + y + n 
is even; in that case it contains all lattice points between 5 n (S) and 5 n -2(S). This is 
illustrated in Figure [6] 




Figure 6. This figure depicts a cross-section of H(M), with the points 
of the sphere Si 6 in (i?(Z),std) marked and three dilates of the 
CC sphere shown for comparison. As will be proven in this section, 
S n has very nearly as many points as the lattice points contained in 
A n S (the shaded region): it has twice the points on half the lines. 



Lemma 26 (cc distance versus height). For any polygonal CC metric, distance from 
the origin is an increasing function of height: regarded as a function oft>0, the value 
d C c((x : y,t),0) is continuous, stays constant on an interval [0, T(x, y)], and is strictly 
increasing thereafter. 

Proof. Let x f = (x, y, t). The distance of a point x from is the value d such <5i/d(x) e S. 
If x t e A Suns, then the dilate of x t that hits S will intersect a side panel, so the distance 
of x t = (x,y,t) to the origin depends only on x, y. Now AS uns is a closed set with 
interior which contains (x,y,0) for every (x,y) e m. Thus (x,y,t) e AS uns for a closed 
interval of times [0, T(x, y)]. 

Now consider x t and x s with s > t. These have the same projection to m, and so 
do <5 r (x t ) and <5 r (x s ) for any r. Thus if <5i/ c i(x t ) e <S, we must have <5i/d(xs) vertically 
above S, meaning that d cc (x s , 0) > d. □ 

Note also that if a is a path in a Cayley graph for H(Z) (i.e., a word written in 
terms of the generators), then there is a corresponding admissible path in H(M) which 
ends at the same point and whose CC length is the same as the word-length in that 



word metric. (See the proof of Lemma 17 for the construction, noting that the extra 



height created by generators is zero for S = std, or whenever the generators lie in m.) 
That is, word paths are realizable as admissible paths in an obvious way, as in Figure[7] 
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Figure 7. Comparing geodesies: x = (4,0,3) is reached by a three- 
sided CC geodesic of length 572, while its word-length in the standard 
generators is 6. 

However, the CC geodesic in H(R) between two points in H(Z) is often shorter than 
the geodesic in the word metric, as seen in the figure: the CC geodesies need not have 
corners at integer points, even if they begin and end at integer points. 

Lemma 27 (Word length versus height). For any (x,y) e Z 2 , let e = e(x,y) = 1/2 
if x,y odd and e = otherwise, so that (x,y,e + m) e £f(Z) for all m e Z. Then 
\(x, y,e + m)| st d is nondecreasing for m = 0, 1, 2, 

Proof. Suppose without loss of generality that (x, y) is in the first quadrant of m. Let 
g be a geodesic in (i? (Z), std) from the origin to a point x = (x, y, z) where z > 0. Then 
z is equal to the signed area enclosed between g and the straight chord from to 7r(x). 
The path contains only ei moves and e2 moves, and there must be some subword eie2 
or else the signed area would be nonpositive. But then let g' be the same path with 
some eie2 subword replaced by e 2 ei. This has the same length and its balayage area is 
one less, so it is a path in H(Z) from to (x, y, z - 1). This may no longer be geodesic, 
but its length provides an upper bound: \(x, y, z - 1)| < \(x, y, z)\. □ 

Next, it is quite easy to see that there is a parity condition on lengths of paths from 
the origin to (x,y,z). 

Lemma 28 (Parity). For any path of length n in H(Z) from to (x,y,z), n has the 
same parity as x + y. 

Proof. All paths in (iJ(Z),std) are of the form Ile^e^, with = x, Y,h = y, and 
length n = Y, \ a i\ + \h\- Since r + |r| is even for all integers, it follows that x + y + ne 2Z. 

Geometrically, this is just the observation that if a path in m to (x, y) goes hori- 
zontally past x or vertically past y, it will have to backtrack by the same number of 
steps. □ 

For fixed (x,y) e Z 2 and neN, let us count the points (x,y, z) e S n . We first note 
that Tr(S n S) = nQ, so there are no points of S n over (x,y) if ||(a;, > n. 

Theorem 29 (Word length in terms of CC ). Fix (x,y) e Z 2 . For z>0 inZ + e, let n 

be the unique integer with the same parity as x + y such that n- 2 < d cc ((x, y,z),0) < n. 
Then 

\(x,y,z)\ std = n. 

Stated another way, this formula says that for all x e H(Z), 

d cc (x, 0) < |x| std < d cc (x, 0) + 2, 

and describes which of the two integers in that range is the correct value. 

Proof. We will restrict ourselves to the half-space z > without loss of generality; the 
situation on the other half-space is obtained by reflection. The point (x, y, e + m) lies in 
the unstable part of H(R) for < m < T, then in a sector with three-sided combinatorics 
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for some T < m < T' , and finally in a sector with four-sided combinatorics for m>T'. 
(See Figure [I]) We consider those three cases separately. 

Fix n > of the same parity as x + y. By Lemma |26[ there is a unique real value 
r > such that (x,y,r) is reached by a CC geodesic of length n. Take that r and 
suppose that the geodesic 7 is three-sided. Note that ir(j) is a path in m moving only 
horizontally and vertically; without loss of generality, 7 has the form e^ef e^ + , where 
t > and x + 2t + y = n. But we know x + y has the same parity as n, so it follows that t is 
an integer. Thus 7 is the realization of a word geodesic, so \{x,y,r)\ = d cc ((x,y,r),Q). 

Now assume that 7 is a 4-sided geodesic from the origin to (x,y,r), with length 
n. Without loss of generality, 7 has the form e%~ x ~ e^'e^'e^ 4 , where t > and 
3x - y + At = n. Since x + y + n is even, it follows that At is an integer with the same 
parity as Ax, so t is either an integer or a half-integer. If t is an integer, then 7 is the 
realization of a word geodesic and \(x, y,r)\ = d cc ((cc, y, r),0). If not, then let s = t- 1/2 
and u = t+ 1/2 be the nearest integers. We can form an integer path e|~ :r ~ ll ef +s e| +ll e^ ;s 
of length n by shortening the horizontal sides and lengthening the vertical sides by 1/2. 
This encloses area z = (x + s)(x + u) - \xy, which is in e + Z, so the endpoint (x,y,z) of 
the new path is in i/(Z). This area is smaller than r = (x + t) 2 - ^xy by 1/4, so it must 
be the nearest lattice point. This new path may not be a geodesic, but it establishes 
the inequality \(x,y,z)\ < n. On the other hand, shortening all four sides of 7 by 1/4 
produces a 4-sided CC geodesic to (x, y, r') of length n - 1, and one easily checks that 
r' < z < r. This shows that d cc ((x, y, z), 0) > n - 1, and it follows that \(x, y,z)\ = n. 

In these cases, we have established that the highest lattice point over (x,y) inside 
the closed CC ball of radius n has word-length exactly n. Now we want to show that 
the lowest lattice point over (x, y) outside the closed CC ball of radius n - 2 also has 
word-length exactly n. But this point has CC norm larger than n - 2, so it has word- 
length larger than n-2, and by parity considerations the word- length must be exactly 
n. 

The last case is easy: for any unstable point (x,y,z) e H(R), its CC norm is 
d cc ((x, y, z), 0) = x + y. We claim that these points satisfy |(x, y, z)| 5 td = x + y as 
well. To see this, just note that starting with the two-sided path efe^, which encloses 
area \xy with length x + y, one can shave off area one unit at a time with generator 
swaps (replacing some occurrence of eie2 with e2ei) until the area enclosed is z. This 
produces a word path whose length matches the CC geodesic, so it too is geodesic. □ 

Thus far, we have shown that if a is a subset of <S reg , then \S n n Act| ~ \H(Z) n A n cr|, 
because S n has twice as many points as A„er, but on half of the lines (see Figure |6|. 
On the other hand, \S n n Aa\ = \H(Z) n A n cr\ exactly for a c 5 uns . Putting these 



together and then using the Counting Theorem (Thm 24) to estimate the number of 



lattice points in A n a, we obtain Theorem 25 
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